Counting colorings of a regular graph 



David Garvin* 
March 3, 2013 

Abstract 

At most how many (proper) q-colorings does a regular graph admit? Galvin 
and Tetali conjectured that among all n- vertex, d-regular graphs with 2d\n, none 
admits more g-colorings than the disjoint union of n/2d copies of the complete 
bipartite graph Kd,d- In this note we give asymptotic evidence for this conjecture, 
showing that the number of proper g-colorings admitted by an n- vertex, d-regular 
graph is at most 

n n(l + o(l)) 

(? 2 /4) 2 ( q %) 2d if q is even, 

((g 2 -l)/4)^( (3 SJ /2 ) 2d if q is odd, 

where o(l) — > as d — > oo; these bounds agree up to the o(l) terms with the 
counts of g-colorings of n/2d copies of K^^. 

An auxiliary result is an upper bound on the number of colorings of a regular 
graph in terms of its independence number. For example, we show that for all 
even q and fixed e > there is 5 = S(e, q) such that the number of proper q- 
colorings admitted by an n-vertex, ci-regular graph with no independent set of 
size n(l — e)/2 is at most 

n 

{q 2 /4-5y, 
with an analogous result for odd q. 



1 Introduction 

Throughout, G is a simple, finite loopless graph, and q is a positive integer. A proper 
q-coloring (or just q-coloring) of G is a function from the vertices of G to {1, . . . , q} 
with the property that adjacent vertices have different images. We write \ q {G) for the 
number of g-colorings of G. 
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The following is a natural extremal enumerative question: for a family Q of graphs, 
which G G Q maximizes c q (G)7 For example, for the family of n- vertex, m-edge graphs 
this question was raised independently by Wilf [21 CE2] and Linial [S] , who both came 
across it in their study of running times of coloring algorithms. Although it has only 
been answered completely in some very special cases many partial results have been 
obtained (see [9] for a good history of the problem). 

The focus of this note is the family Q(n, d) of n- vertex (i-regular graphs with d > 2 
(the case d — 1 being trivial). Galvin and Tetali jS] used an entropy argument to show 
that for 2d\n no bipartite G in Q(n,d) admits more g-colorings, for each q > 2, than 
T^K^d, the disjoint union of n/2d copies of the complete bipartite graph K djd with d 
vertices in each partite set. More generally they found c q (G) < c q (K d ,d) n ' 2d f° r an n i 
d and bipartite G G Q(n,d) (this is [51 Prop. 1.2] in the special case H = K q ), and 
they conjectured that this bound should still hold when the biparticity assumption is 
dropped. 

Conjecture 1.1 Fix d > 2 and n > d+1. For any G G G(n, d) and any q > 2, 

c q (G) < c q (K d>d )%. 

For q = 2 this follows immediately from the bipartite case established in [5] . Zhao [H] 
established the conjecture for all q > (2n) 2n+2 , and in the case 2d\n Galvin [3], using 
ideas introduced by Lazebnik [7] on a related problem, reduced this to q > 2 (™^ 2 ) , but 
neither the approach of [3] nor that of [H] seems adaptable to the case of constant 
q > 3. 

Conjecture 11.11 is a special case of a more general conjecture concerning graph 
homomorphisms. A homomorphism from G to a graph H (which may have loops) is a 
map from vertices of G to vertices of H with adjacent vertices in G being mapped to 
adjacent vertices in H. Homomorphisms generalize g-colorings (if H = K q then the set 
of homomorphisms to H is in bijection with the set of g-colorings of G) as well as other 
graph theory notions, such as independent sets. A independent set in a graph is a set 
of pairwise non-adjacent vertices; notice that if H = H m d is the graph on two adjacent 
vertices with a loop at exactly one of the vertices, then a homomorphism from G to H 
may be identified, via the preimage of the unlooped vertex, with an independent set in 
G. Amending a false conjecture from [5], the following conjecture is made in [3]. Here 
we write hom(G, H) for the number of homomorphisms from G to H . 

Conjecture 1.2 Fix d > 2 and n > d + 1. For any G G Q(n, d) and any finite graph 
H (perhaps with loops, but without multiple edges), 

hom(G,H) < max ^hom(K d:d , H)^ ,hom(K d+1 , H)^^ , 

where K d+ i is the complete graph on d+1 vertices. 

When d > q we have hom(K d+ i, K q ) = and so in this range Conjecture 11.21 implies 
Conjecture 11.11 
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The inspiration for Conjecture II .2\ and the partial result of [5] that the conjecture is 
true for all bipartite G, was the special case of enumerating independent sets (H = H ind ). 
In what follows we use i(G) to denote the number of independent sets in G. Alon [TJ 
conjectured that for all G G G(n, d) we have 

i(G) < i(K dd ) n/2d = (2 d+1 - ]W 2d = 2 n / 2 + c, ( n ( 1 +°( 1 ))/ M ) ) 

and proved the weaker bound i(G) < 2 n / 2+0(n / dl/10 ). The sharp bound was proved 
for bipartite G by Kahn [6], but it was a while before a bound for general G was 
obtained that came close to i(K d>d ) n l 2d in the second term of the exponent; this was 
Kahn's (unpublished) bound i(G) < 2 n / 2 +°Wi+°(i))A0. This was improved to i(G) < 
2 n/2+o(n(i+ (i))/2d) by Galvin g| . Finally Zhao [13] deduced the exact bound for general 
G from the bipartite case. 

The aim of this note is to obtain an asymptotic version of Conjecture ll.lt along the 
lines of Galvin's upper bound on the count of independent sets in n-vertex, <i-regular 
graphs. Before stating the main result, we need to do some preliminary calculations. 
Define 



7] = 77(g) 



%■ if q is even 

LIJril = ^ if g is odd, 



and 



■ ( if q is even 

m = m(,)H U) + ( r ,W = W if ' isodd ' 



Fix a bipartition £ U O of K djd . The set of g-colorings of K^d may be written as 
\JfA t B)C(A, B) where the union is over ordered pairs (A, B) with \A\, \B\ > and \A fl 
B\ = 0, and where C(A, B) consists of colorings in which the set of colors appearing on 
S (resp. O) is exactly A (resp. B). Using inclusion-exclusion we easily get \C(A, B)\ = 
(\A\\B\) d/2 + 0((\A\\B\-l) d/2 ). The maximum possible value of | A\ \ B \ is i](q) (achieved 
when A U B = {1, . . .q) and \A\ — \B\ G { — 1,0, 1}), and there are m(q) pairs that 
achieve this value. It follows that 

\c q (K d>d ) -r] d m\ < (v~c) d 

for some c = c(q) > 0. This leads to 

c q (K d4 )2d = V 2m 2^ (1) 

where (here and everywhere) o(l) — > as d — > 00. 

Our main theorem is an upper bound on c q (G) for all G G Q(n, d) that matches 
up to the o(l) term. 

Theorem 1.3 Fix d > 2 and n > d + 1. For any G G G(n, d) and any q > 3, 

n n(l + (l)) 

Cq(G) < rj^m 2d 
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The best previous result in this direction was from [3], where it was shown that 

n n(l- g )(l + o(l)) 

c q {&) <r] 2 m d i 

(This only appears explicitly in [3] for q = 3, but follows immediately for general q 
from Proposition II .41 below by taking a = n/q; note that for all smaller a, c q (G) = 0.) 

To prove Theorem II. 31 we consider the independence number a(G) of G, the number 
of vertices in a largest independent set, and deal separately with large a(G) and small 
a(G). The case of large a(G) has already been dealt with in |3J Section 5], where an 
entropy approach is used to obtain the following result. 

Proposition 1.4 Fixd > 2 andn > d+1. For any G G Q(n,d) witha(G) > n(l— e)/2 
and any q > 3, 

n re(l + e) n 

Cg[G) <7]2m 2d Cj? 

where C = C(q) > 1 is a constant. 

To bound c q {G) when G has no large independent sets we adopt an argument of 
Sapozhenko to obtain the following, which we prove in Section |2j 

Lemma 1.5 Fix d > 2 and n > d + 1. For each q > 3, there are constants c±,C2 > 
such that if G G Q(n,d) has a(G) < n(l —e)/2 then 



c q (G) < 772 exp 2 c i n y _ c 2 £n 

(For concreteness, here and throughout log = log 2 .) Along the way, we describe a very 
simple argument that gives the weaker bound 



c q {G)<^2°^^) (2) 

valid for all G G Q(n,d). (Here and elsewhere, constants implied in O or Q statements 
may depend on q.) 

Taking e = C'-y/log d/d for suitably large C' = C'(q) > 0, Proposition 11.41 and 
Lemma 11.51 combine to give Theorem 11.31 in the following precise form: for any G G 
Q(n, d) and any q > 3, 

c q (G) < J] i m ^ +0 ( I 7f)). 

Lemma H~5l together with [51 Prop. 1.2] (the bipartite case of Conjecture II. ip also shows 
that the only G G Q(n,d) which remain as potential counterexamples to Conjecture 
11.11 are those which are non-bipartite and have an independent set of size at least 
n/2(l — Cy/ (log d)/d) for some constant C > 0. 

A simple corollary of Lemma 11.51 is that for each fixed e > and q > 3 there is 
5 > such that for all d > 2, n > d + 1 and G G Q(n,d) with a(G) < n(l — s)/2, we 
have 

c q (G) < (rj — 5)5. 



' log d ' 
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A natural question to ask is how 5 (more precisely, the supremum over all S for which 
the preceding statement is true) varies with e in the range 0<e<l — (2/g). At e = 
we have 5 = (by Theorem 11.31 and the example of the disjoint union of K^s), and 
from the fact that c q {G) = whenever a(G) < n/q we conclude that 5 = r\ for all 
e > 1 - (2/q). 

Question 1.6 Fix d>2, n>d+l, q>3 and < e < 1 — (2/q). What is the 
maximum of c q (G) over all G E G{n,d) with a(G) < n(l — e)/2? 



2 Proof of Lemma 11.51 — Small independent sets 



To obtain (j2J) we modify an argument due to Sapozenko [TTj, originally used to enu- 
merate independent sets in a regular graph; a further modification of this argument 
will give Lemma [1.51 

Let (p = y/d logd/q (note that if < d). For an independent set I in G, recursively 
construct sets T(I) and D(T) as follows. Pick u\ G / and set T\ = {ui}. Given 
T m = {ui, u m }, if there is u m+1 e I with N(u m+1 ) \ N(T m ) > <p, then set T m+1 = 
{ui, . . . , u m+ i} (here N(-) indicates open neighborhood). If there is no such u m+ i, then 
set T = T m and 

D(T) = {ve V(G) \ N(T) : N(v) \ N(T) < <p}. 



Note that 



T < ~, 3 

since by construction n > N(T) > (|T| — l)ip + d > \T\tp; that 

I C. D (4) 

since if 7\D 7^ 0, the construction of T would not have stopped (note that N(T)C\I = 0); 
and that 

, , nd n / u)\ 

\D\ < — <_(1 + 21). 5 

The second inequality here follows from ip < d. To see the first, consider the bipartite 
graph with partition classes D and N(T) and edges induced from G. This graph has 
at most d\N(T)\ < d(N — \D\) edges (since each vertex in N(T) has at most d edges to 
D, and there are at most iV — \D\ such vertices), and at least (d — y 3 )!^! edges (since 
each vertex in D has at least d — ip edges to N(T)). Putting these two inequalities 
together gives 

Now a g-coloring of G is an ordered partition of V(G) into q independent sets, 
(Ii, . . . ,I q ), with Ik being the set of vertices colored k. Following Sapozhenko's argu- 
ment, we associate with this partition an ordered list (D(T(Ix)), . . . ,D(T(I q ))). We 
recover all g-colorings of G (and perhaps more) by finding all such lists, and then for 
each list (Dx, . . . , D q ) finding all ordered partitions of the V(G) into q sets (/]_,..., I q ) 
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(not necessarily independent sets), with I k C D k for each k. We say that such a 
partition is compatible with the D^s. 

By ([3]) each possible D k is determined by a set of size at most n/tp and so the 
number of choices for (Di, . . . , D g ) is at most 




2 °(»V S * 3 ) i 



(6) 



the equality using standard binomial estimates. We now bound the number of parti- 
tions compatible with a particular (Di, . . . , D q ). For each v G V(G) let a„ denote the 
number of D^'s with t> e .D*.. Using (jSJ) we have 



I* = EW=f l + (7) 



uey(G) fc=i 
By the AM-GM inequality we get 

n 5 fwm. ( 8) 

Combining (EJ) and (jHJ) we get (J2]) for even g. 

We now work towards a better bound that incorporates the independence number 
of G. Since we are upper bounding c q we may assume, by adding vertices in some 
deterministic way if necessary, that D\ satisfies \D\\ > n/2. Now we look at the 
subgraph induced by D 1 . It inherits from G the property that no independent set has 
size greater than (n/2)(l — e). This means that D\ has a matching of size at least ne/4 
(which may be found greedily). 

Fix such a matching M = {xiyi, ■ ■ ■ ,x\m\V\m\}- in our naive count of colorings, 
we had a factor a Xl a X2 to account for the possible colors assigned to x\ and y\ in a 
compatible partition. But since x% and y% are adjacent, we cannot assign color 1 to 
both vertices, and so we have at most 

Q'xiO J X2 1 ^xi^X2 I 1 ) — J xiQ'X2 I 1 — o 

choices for this pair. Applying this argument to each of the pairs (xj,?/,), we get an 
upper bound on the number of colorings compatible with (_D 1; . . . , D q ) of 

2 °( n v / ^F)-^) 

using the AM-GM inequality for the first term in the product, and our lower bound on 
\M\ for the second. Combining with (jH]) we obtain Lemma [1.51 for even q. 
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Now we turn to odd q. Preceding exactly as before, we have 



c q (G) < [ J] *\ 2°( n ^)-^ n \ 
\veV(G) J 



so we are done (both with Lemma 11.51 and with fl2]) in the case of odd q) if we can 
bound 

J] a v < ([q/2\\ q /2])H°( n ^) . (9) 

veV(G) 

For this we need the following optimization lemma: 

Lemma 2.1 Let ai, . . . , a m be positive real numbers with average a. If there is a 5 > 
such that no is in the interval (a — 5, a + 5), then 

m 

]Ja t < {a 2 - 6 2 ) f = (a-6)?(a + 6)?. 
i=i 

Proof. We begin with m even, say m = 2k. Without loss of generality, assume a\ < 
. . . < a m . Let avei be the average of a\ through a k and ave 2 the average of a k+1 through 
a m ; clearly avei < ave2, and avei + ave2 = 2a, so avei = a — 5' and ave2 = a + 5' for 
some 5' > 0. We claim that 5' > 5. If not, then a k (being at least avei) is at least 
a — 5, and so by hypothesis is at least a + 5, which forces ave2 to be at least a + 5, a 
contradiction since ave 2 = a + 5' < a + 5. 

Armed with the information that 5' > 5, we apply the AM-GM inequality to a\ 
through a k and a k +i through a m separately and get 

m 

11 en < (a - 5') k {a + 5') k = (a 2 - 5' 2 ) f < (a 2 - 5 2 ) f . 

i=i 

To deal with odd m, we consider the problem of maximizing a « YllLi a i subject 
to the conditions that no a^ or a\ lies in the interval (a — 5, a + 5), and that the average 
of the 2m numbers is a. By the even case, the maximum is at most (a 2 — 5 2 ) m . This 
remains an upper bound on the maximum if we add the conditions a^ = a! { for each 
% = 1, . . . , m; but then the maximum becomes the square of the maximum of n^Li a h 
and we are done. □ 

To apply Lemma [2TT1 we first assume (as we may do without loss of generality) that 
each Di satisfies \Di\ > n/2. This assumption together with (J5J) and our specific choice 
of tp gives that the average of the a„'s satisfies a G [q/2,q/2 + (l/2)ylogd/d]. Since 
the a„'s must be integers, and ^/log d/d < 1, we may take 5 = (1/2)(1 — yJ\ogd/d) in 
Lemma 12.11 to get 




k/2j -h <\l ) (k/2]) f = ([q/2\\q/2])^ 2< n ^) . 



as required. 
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